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MICROSATELLITE MARKERS FOR IDEtaTIFYING 
CANINE GENETIC DISEASES OR TRAITS 

FIELD OF THE INVENTION 

This invention relates generally to genetic markers and methods of making 
5 and using such markers, and more particularly, to a microsatellite marker that may 
be used to detect copper toxicosis in canines. 

BACKGROUND OF THE INVENTION 
Due to inbreeding and the relatively shaltovy^ gene pool, a large number of 
genetic diseases are present in dogs (Clark. R.D. et af., Medical and Genetic 
10 Aspects of Purebred Dogs (Forum Publications, Fairway, KS) (1994) and Robinson, 
R., Canine Pract. 16:29-34 (1991)). Some of these genetic diseases such as copper 
toxicosis in the Bedlington terrier breed, are so prevalent in a particular breed that 
the mutant allele frequency may be higher than that of the normal allele (Herrtage, 
M.E. etal., J. Small Anim, 28:1141-1151 (1987); and Yuzbasiyan-Gurkan, V. etal., 
16 Genomics 15:86-90 (1993)). Other genetic diseases cross many breeds, as 
exemplified by progressive retinal atrophy causing blindness (Bamett, K.C., Adv. Vet. 
ScL Comp, Med. 20:9-67 (1976)) and hip dysplasia resulting in painful and crippling 
arthritis (Corley, EA. Small Anim. Pract. 22:570-593 (1992)). 

Canine copper toxicosis (CT) is an autosomal recessive genetic disorder of 
20 copper accumulation which results in severe liver damage. Unless specific* anti- 
copper treatment is instituted, affected dogs die by three to seven years of age. 
White reported in several breeds, it is best characterized in Bedlington terriers, with 
the frequency of the defective gene being estimated at 50%. The disease is also 
prevalent in the West Highland White Terrier and Keeshond. 
25 Currently, the only method for diagnosing affected CT dogs is by a 

quantitative liver copper assay in a liver biopsy sample, after one year of age. 
Unfortunately, heterozygous and homozygous normal animals are indistinguishable 
from each other by this test, in order to determine rf a dog is a heterozygous carrier, 
test-breeding strategies must be employed which require that there be a dog of a 
30 known genotype to breed against the potential carrier This process is very costly 
and results in the birth of many affected individuals. It is therefore impractical for 
breeders to identify breeding stock free of the gene and currently carriers of the 
gene are only identified after they are found to be the parents of an affected dog. 
Because like CT, many of the canine genetic diseases are recessive, various 
35 methods have been investigated which would identify, on a molecular level, 
phenotypically normal carriers. One method that has been employed is the whole 
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gene subtraction method. This approach requires the sorting out of differences 
between DNA from those with or without the disease or trait with molecular 
manipulation methods. Unfortunately, this technique is somewhat impractical and 
requires that all variability within individuals with the trait as well as the variabiJity 
5 within those without the trait Independent of the trait, be differentiable from the one 
or few that are dependent on the trait. Furthermore, this method has only been 
demonstrated on very simple organisms such as yeast, and while this technique 
appears theoretically possible for higher species, it rapidly becomes impractical, as 
it requires many breeding studies of large numbers of affected animals. 
10 An alternative method, the use of restriction fragment length polymorphisms 

(RFLP), is extremely labor intensive and expensive with respect to both 
characterization and analysis. Furthermore, this technique requires large quantities 
of DNA. generally is limited to only two alleles, and only a few loci have thus far 
been characterized for the canine genome. It appears that with this method, a 
15 separate genetic system must be generated for each breed of dog, and such a 
library may not be sufficiently variable in most situations of interest. 

The randomly amplified DNA fragment length polymorphism (RAPD) 
approach uses random primers to amplify fragments of genomic DNA that vary from 
individual to individual within a species. While the primers are relatively easy to 
20 generate, the method is unreliable with minor experimental changes resulting in the 
resolution of different DNA band patterns. Furthermore, only a few such bands have 
been characterized for the canine genome. 

The candidate gene method is another alternative wherein one or more 
candidate genes is identified based on what is known about the biochemical and 
25 dinicai or other phenotypic attributes of the disease or trait and information about 
similar conditions in another species where a gene has been identified for a similar 
trait. This approach was taken in evaluating genes linked to the Wilson's disease 
gene in humans, a disease similar to CT. Unfortunately, the genes linked to the 
Wilson's disease in humans were not linked to CT in dog (Yusbasiyan-Gurkan, V, 
30 et aL, Genomics 15:86-90 (1993)). Thus, even under the best-case scenario, the 
candidate gene method is merely a guess and the approach is of course, further 
limited by the availability of identified genes. 

Because canine pedigrees for various genetic disease are abundant, with 
several gen rations and two or more affected members pr sent in many cas s, 
35 these pedigrees lend themselves to linkage studies, provided polymorphic markers 
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are available. Since most of the breeding is controlled, identfficalion of linked 
markers would allow concerned breeders to greatly reduce the incidence of these 
diseases in future generations. 

One type of marker that has been developed consists of simple sequence 
6 length polymorphisms (SSLPs). SSLPs arise from a varying number of repeats of 
a simple sequence, such as a dinucleotide repeat at a given locus, and have been 
reported to be frequent in most eukaryotic genomes (Tautz. D. et al., Nucleic Acids 
Res. 12:4127-4138 (1984)). Such loci, also referred to as microsatellites (Tautz, D., 
EXS: DNA Fingerprinting: State of the Science 1:21-28 (1993)), are best exemplified 

10 by those containing the (CA)„ motif and are found to be highly polymorphic in many 
species and are being successfully used in the construction of genetic maps of the 
human (Weissenbach, J. et al., Nature 359:794-801 (1992)). mouse (Dietrich, W, et 
al.. Genetics 131:423-477 (1992)). rat (Serikawa. T. et aL, Genetics 131:701-721 
(1992)) and bovine (Barendse, W. et al, Nat Genet, 6:227-235 (1994)) genomes, 

15 High polymorphic information content and amenability to analysis by polymerase 
chain reaction (PCR) and thus to possible automation, make microsatellites excellent 
linkage and mapping tools. 

CA microsatellites from the canine genome have been identified and their 
polymorphism evaluated on sets of unrelated dogs (Holmes, N.G. et al., Anim, 

20 Genet, 24:289-292 (1992)) or mixed bred dogs and beagles (Ostrander, E.A. et al.. 
Genomics. 16:207-213 (1993)). Presently there are about 150 SSLP-type markers 
for the canine genome available. Unfortunately, these known markers lack the 
ability to detect a linked marker for any genetic trait, because of the low probability 
of finding a linked marker sufficiently dose to a given genetic locus, to ensure 

25 detection. Many purebred dog populations have a relatively high level of inbreeding 
wrtiich makes it important that such markers be very polymorphic. Further, Important 
genetic diseases occur across many dozens of breeds, requiring the markers be 
polymorphic in most, if not all, breeds with many different breeds having varying sets 
of genetic problems. 

30 It would thus be desirable to provide a method for Identifying genetic diseases 

and traits In canines. It would also be desirable to provide a method for identifying 
genetic diseases and traits in canines which has high variability and low breed 
specificity. It would further be desirable to provide a method which allows breeders 
to select and breed for certain favorable characteristics, or conversely, to avoid 

35 unfavorable diseases and traits. It would further be desirable to provide a method 
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which allows the detection and screening of a recessive genetic disease such as 
copper toxicosis, which is phenotypically undetectable in heterozygote earners. It 
would further be desirable to provide a method for identifying a carrier of a genetic 
disease or trait and affected individuals without undergoing test-breeding 
5 experiments. It wou(d also be desirable to provide genetic markers for the canine 
genome. It would further be desirable to provide a marker for the CT gene in 
canines, 

SUMMARY OF THE INVENTION 

A set of microsatellite markers are provided which are useful in identifying 

10 linked mai1<ers for canine genetic diseases and traits. In particular, five hundred and 
nineteen microsatellite DNA markers are provided which are highly variable within 
and across many breeds of dogs. The microsatellite markers are derived ft'om 
regions of genomic DNA which contain a repeated motif e.g., (CA)„, flanked by 
unique sequences. The number of units contained within the repeat motif is 

15 variable, such that various different alleles are present in any given population. The 
unique flanking sequences may be used as polymerase chain reaction (PCR) 
primers which allows for the rapid amplification and characterization of each locus 
from a small amount of DNA, Thus, each microsatetiite marker has a unique set of 
primers. The microsatellite markers and their progeny are especially useful in 

20 detecting genetic diseases not phenotypically visible and identifying carriers of 
recessive diseases. In a preferred embodiment, microsatellite markers are provided 
which may be used to detect the canine copper toxicosis gene. 

In addition to identifying canine genetic diseases such as copper toxicosis, 
the microsatellite markets may also be used to create a genetic map of the canine 

25 genome, generate specific breed profiles, settle parentage disputes and identify dogs 
by DNA fingerprinting. Pedigrees of affected individuals, their siblings, parent and 
progeny can also be created. Breeders and owners can thus choose breeding stock 
thereby reducing and possibly eliminating the incidence of specific genetic 
diseases. 

30 Additional objects, advantages, and features of the present invention will 

become apparent form the following description and claims taken in conjunction with 
the accompanying drawings. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

The various advantages of the present invention will become apparent to one 
skilled in the art by reading the following specification and by referencing the 
following drawings in which: 

Figure 1A is a bar graph showing the average and standard deviation of 
heterozygosity percentages across loci within a breed; 

Figure IB is a bar graph showing the average and standard deviation of 
heterozygosity percentages across breeds within a locus; 

Figures 2A-2D are photographs of gels showing marker Jocus D02011 in 
various breeds; and 

Figure 3 is a photograph of a gel showing segregation of alleles at the 
C04107 locus in a Bedlington temer pedigree. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

Five hundred and nineteen microsatellite markers from specific gene loci are 
provided which are highly variable within and across many breeds of dogs. The 
microsatellite markers of the present invention comprise a repeat motif e.g., (CA)^, 
found in the canine genomic DNA, flanked by unique sequences. The unique 
sequences (also referred to herein as primer pairs) may be used as PGR primers, 
allowing the rapid amplification and thus detection of the sequence of interest in a 
small DNA sample. Table 2A sets forth the microsatellite markers of the present 
invention. The microsatellite markers and their progeny are especially useful in 
detecting genetic diseases not phenotypically visible and Identifying caniers of 
recessive diseases. 

In a preferred embodiment, microsatellite markers are provided which may 
be used to detect a earner of the canine copper toxicosis gene. As further set forth 
in Specific Example II below, marker locus C04107 may be used to predict the 
inheritance of alleles at the copper toxicosis locus. C04107 has also been used to 
locate two other marker loci C04107B and C04107C. which either singly, or as a 
group, may also be used to detect the copper toxicosis gene. 

The method of the present invention is useful for identifying disease free 
individuals (homozygous normal), carriers (heterozygous) and affected individuals 
(homozygous affected) at any stage of development. While a single marker may fail 
to provide the required information In any particular pedigree, a series of progeny 
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markers will, and thus such a family of progeny markers derived from the linked 
markers set forth herein, are also included in the invention. 

SPECIFIC EXAMPLE I 
Materials and Methods 
5 isolation and Cijaracterizat/on of Miorosateliite Loci, Established protocols 

were used for the cloning and screening procedures as described (Sambrook^ J. et 
a!., Molecular Cloning, A Laboratory Manual (2nd ed. Cold Springs Harbor: Cold 
Springs Harbor Laboratory Press) (1992)). Genomic DNA was isolated from a 
peripheral blood sample from a Labrador retriever and partially digested with Bam 

10 HL Size selected fragments purified from agarose gels using QIAEX beads (Qiagen 
Corp., Chatsworth, CA) were cloned into the phagemid vector pBS (Stratagene, La 
Jolla, CA) to construct a library of average Insert size of 600 bps and propagated in 
the host XL-1 blue. The library was plated at tow density (about 500 coionies/piate) 
without amplification. Duplicate nitrocellulose colony lifts were prepared, denatured 

16 and hybridized with {CA)^^ oligomer, labeled with ^^P dCTP using terminal 
transferase. Positive colonies were picked with a sterile pipette tip and lysed in 50 
/#! of a solution consisting of 1% Triton X 100, 20 mM Tris and 2 mM EDTA. Using 
primers complementary to the T3 and 17 promoter sequences which flank the 
cloning site, the inserts were amplified ft^om 1-2 //I of the colony lysate in polymerase 

20 chain reactions for 30 cycles of 94, 55 and 72**C at 1, 2 and 3 min., respectively 
after an initial denaturation at 94**C for 4 min. The standard buffer, nucleotide and 
primer concentrations were 50 mM Tris-HCI (pH 8.3 at 25**C), 50 mM KCI, 1.5 mM 
MgCI^, 200 /iM dNTPs and 40 pmoles of each primer in 100 //i reactions. PCR 
reactions were carried out on either a Perkin-Elmer Cetus (Perkin Elmer, Corp. 

25 Nonwalk, CT) or an MJR PTC-100 thermocycler (MJ Research. Watertown, MA). To 
carry out secondary screenings of the clones, aiiquots of the amplification products 
were run on 1 .5% agarose TBE gels (90 mM Tris, pH 8.3, 90 mM boric acid, 2 mM 
EDTA). Southem blot analysis was earned out on the gels after transfer to 
Gene-Screen Plus membranes (NEN. Boston, MA) using the alkaline transfer 

30 protocol. The membranes were probed with (CA),^ oligomers, 3' end-labeled with 
digoxigenin-dUTP using terminal transferase. A chemiluminescence detection 
system based on Lumi-Phos 530 as a substrate was used to detect positive 
hybridization signals following the recommendations included in a commercial kit, 
Genius (Boehringer Mannheim Corp., Indianapolis, IN), The membranes were 

35 washed to a final stringency of 0.1 X SSG (1 X SSC = 15 mM sodium chloride, 1,5 
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mM sodium citrate) at 65**C. The blots were than processed for immunological 
detection as described by the manufacturer. Once a clone was confirmed to be 
positive, the corresponding amplification product was then purified using QIAEX 
beads (Qiagen Corp.» Chatsworth, CA) after electrophoresis on TAE gels (40 mM 
5 Tris acetate, pH 8.3, 2 mM EDTA) and directly sequenced using cycle sequendng 
(Delta Taq 2.0 Cycle Sequencing Kit. United States Biochemical Corp.. Cleveland, 
OH). The sequencing reactions were carried out according to the manufacturer's 
instructions with the slight modification that T3 and T7 primers labeled at their 6' end 
with ATP (NEN, Boston. MA) using T4 polynucleotide kinase were used as 

10 sequencing primers. Sequencing products were analyzed by electrophoresis on 6% 
polyacrylamide gets containing 8M urea. The gels were dried and exposed to 
X-OMAT X-ray film (Eastman Kodak. Rochester. NY) for 1-2 days and developed. 
Primers flanking the repeat motif in each insert were selected to minimize hetero- 
and homedimerization; occasionally, the computer program Oligo (National 

15 Biosciences. Plymouth, MN) was used to help in the primer selection. The primers 
were synthesized by the Michigan State University Macromolecular Structure Facility. 

Dog DNA Panel To check the usefulness of microsatellfte markers within 
and across different breeds of dogs, a dog DNA panel was established. The breeds 
to be included in the panel were chosen with consideration given to the diversity in 

20 origin and function of breeds that exist. Table I presents various characteristics of 
the breeds chosen for the dog panel (Alderton, D., The Eyewitness Handbook of 
Dogs (New York: Dorling Kindersley) (1993); American Kennel Club. The Complete 
Dog Book (17th ed. New York: Howell Book House) (1985); Clark, R.D.. Medical and 
Genetic Aspects of Purebred Dogs (Forum Publications, Fairway, KS (1994), 

25 Walkowitz, et aK, Successfuly Dog Breeding (2nd ed., New York, Howel Book 
House) (1994); and Lee, M.P.. The OfTicial Book of the Scottish Terrier {Heptune 
City, TF.H. Publications p. 158) (1994)). Five to ten individual dogs from each 
breed were selected for inclusion in the panel. Pedigrees were investigated to 
ensure that only dogs that had no common ancestors through four generations were 

30 included for independent representation of alleles. Ten. apparently unrelated, mixed 
bred dogs were also sampled. DNA was isolated from peripheral blood as 
previously described (Sambrook, J et al.. Molecular Cloning. A Laboratory Manual 
{2nd ed.. Cold Springs Harbor, Cold Springs Heritor Laboratory Press) (1989)), 
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Analysis of MicrosatelUte VariabHity. Amplification of the correct target was 
verified by comparing the product obtained from genomic DNA to that obtained from 
the reference clone. The variability at each locus was tested by amplification of DNA 
from the dog panel. PGR conditions were 35 cycles of 94*^0. optimal annealing 
5 temperature (50-60**C) and 72*C at 1, 1, and 2 min.. respectively after an initial 
denaturation at 94*C for 4 min. in the standard PCR buffer conditions described 
above. 100 ng of genomic DNA was used as template in each reaction. 10 //I of the 
PCR products were analyzed by vertical electrophoresis using a modification of a 
SDS-PAGE (sodium dodecy) sulfate-polyacrylamide gel electrophoresis) protocol 

10 (Laemmli, U.K.» Nature 227:680-685 (1970)) as described previously (Tas, S., Anal. 
Biochem. 188:33-37 (1992)). An HSI SE500 vertical slab gei electrophoresis system 
(Hoeffer Scientific Instruments, San Francisco, CA) connected to a cooling unit was 
used. The gels were poured between 16 x 16 cm. plates using I mm spacers. 1.5% 
acrylamide stacking gels of 2-3 cm were used on top of 12.5% acrylamide 

15 separating gels with 30:0.8 acrylamide to bis-acrylamide ratio. The gels were run 
at 40 mA through the stacking gel and than at 70 mA thorough the separating gel 
until the bromophenol blue dye reached the end of the plates, for approximately 4 
hours. The amplification products were visualized after silver staining with the Silver 
Staining Kit (Bio-Rad Laboratories. Richmond, CA). This procedure resolved 

20 differences greater than or equal to 4 bps in the size of amplification products in the 
75-250 bp range. 

Results 

Screening 110 plates resulted in the isolation of 1064 independent clones that 
were confirmed to be positive on secondary screening. Using 600 bps as the 
25 average insert size and 500 as the average colony number per plate, it was 
calculated that 1064 positives reflected an estimated incidence of one CA repeat 
clone every 31 kilobases in the dog genome. 

The first 14 CA repeat loci for which primers were designed are presented 
in Table 2 together with the optimal annealing temperatures. 
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The complete set of microsatellite markers is set forth in Table 2A below. 
These markers were identified and the primers designed as described above. 



wo 97/31011 PCT/US97/02396 



12 



Table 2A 



Muter 





Asn sequence 


PCR 

PltMblCt 


Moor 


CTACTCTrOTATTCCATCAAT 


ATTTTCOOCATTCTCACTGOT 


242 




iYlACATAAnCrOTQAOAAOAC 


ATTGAAACrOATAOAOAAOAO 


140 




TACnOAOOCCACACTACTGA 


TCCAAOOOAAOTCATAOAAC 


226 


(GDU 


AGCrrCCAOOTCTOOnTRXXAAO 


TATCCCAQAOCTTAOAGCCTOGCA 


174 


(GDll 


mTTOATOGCTOAATAATA 


oaatpoataaagaagatgtq 


t2 


(CT>14 


CTGCI ItllCCCICIOCCTATQT 


CTAOCACAOCCAATOrrGATTOA 


140 


<OT»l2 


AnntTTCK^CTAACTaACTQAQCC 


TTrCAAAATGGOCrrrCCTTT 


162 


(AQ17 


AOGOTCCCTAACTOACrOAOCC 


TrTCAAAATOGOCriTCCTrT 


162 


(A017 


TfU;:cCCTTAAAOATTTTATl^l' 


CCTOCATCOAACCTGCTTCT 


127 


(CA)10ACT(AC)12 
CA0)4 


TCCTGCATGGACfCCTGCTTCT 


tototattcaoatotgctacttgct 


Itl 


T11A2G(AT4XAT 
3>2(AT2)^Aq>I0 
„/0A)16 


Ck:ACCACrrrGTAACCCTTOAAC 


TCGCATAOOATOATOAATAATA 


ISl 


(CAHTATCA)!} 


ATOOAGCCTACTTCTCCCrC 


OACTTCCTOTATTGGTTACACT 


m 


rroMi 


ATCAOTCCATTCTOATTGGCTATC 


GAAAATGGCAOTTGTACCTOAATCT 


209 


fTOM3frA>4 


ATCACATCCAAATCAAOACTAT 


TGTCCTATGCCTCTCCTATTAT 


in 


(A015 


TOAcrrrACcrrACTTCAccrr 


AOGCCAACTTOGTTACAGATTA 




(CA)3T(AC)2C2(C 
A% 


CAGACCCTTCAGATAACAGTA 


ATTATTci iicccrrrrcTAC 


230 


(OT>9r(rG>4{TA)4 

(Tor? 


CATATCCATCCTCCTAAACTTTC 


AGTGCCTAAAACTAACAOAACTG 


173 


(OT^2AfGT)9 


CCAOGAACnTATOATrCTAAATGT 


GAGCTTGCTTCTCCCTCTOCC 


214 
107 


(TO)l2 


CTTTTCCTATTOTCACAAATO 
AG-rax:ACAtCOGGCTCrCT 


ACAOATGAATOAATACAGTTO 
TGCTOOTil CTCTCn GTOTUTTA I 


169 


fCA)9TAfCA>4 


OTOG AGCCTGC 1 TCTCCCTCTQ 


CnTCCAAOTOCAAACACATAOC 


191 


(Cn)l(A3T)n 


TACCTOAGTCAOTTTACCTAOCA 


07TTCTACAOTCAACCAOATO 


m 


(0D19 


TAAGAOTTATOCCACXTOACC 


CCAOOGAAOAOACCAGTATATGA 


too 


(0TH2 


TAAAOGTCCATTOATAOAQOA 


TGATCCXyVGOAGTTCATTCTT 


105 


(Aqi2 


GAGCCTGCTTCTCCCTCTO 


TGTTTCn CAATOACCTTTCAG 


175 




ATGGGCTCCAAGAATAOCA 


ACCAGAAACTTCATrGTCTCC 


119 


{GAM2 


OAAGTAAATCAACAAACAATCA 


OAAGCAAAAOTATAAGAGCTGTG 


87 


(Aqil 


ATTCTTTCnrATOOCTAOOCAOT 


TOAGTTTCrrcCCTCTTTCTCT 


150 


(GT)6ACTO>5ACrG 


AGACCACTCTGCTCCCTCTT 


TGCCTTGAAATGAACAATOA 


»4 


fGT)l5 


AGGTOTTCTCACTCCTCATA 


CTCCCTCTOCCTGTCTCTCT 


115 


rCA)t0 


CTOAOCAAOACCCATACCACTT 


CCTCCCCACAACAATCTATTTC 


ISO 


(TG)7TArrO>4 


GCATGAGATAAOACACCACCTGTT 


TTCATTTCCTGCCTCCTGTG 


136 


fOT>9 


GACGCTGACAACTGTTTGCTA 


OOAGATAAATCATGAGAACTCA 


284 


(AT>2TCAT)7CA(0 
A><»»(CA)7(GA>2C 
CA^ 


OATnTATTCATTTATCCATCAC 


CTCCCTCTOCCTATGTCTCtO 


107 


(CA)t6rGA)16 


TGOTGAAAGTAACTAAOAACA 


TCCCTCTOCCTATGTCTCTO 


150 


|CAiI6fOA>17 


OTrcrrCCCCAATOGTATTTA 


TTGCATAAOAGOCAOCAAACT 


246 


^CA^A2(CA)3 


TCTOCCTATGTCrCTOCCTOT 


ATAAOATACACOAACCATTACjCC 


109 




CCTOCATOOAGCCTOTTTCTC 


CATTTCTOCAAGACATACTOOTA 


145 




ATOCTGTTOATTACACAOACC 


ATCACTTCCTOOTATTCACAC 


t09 


{OTV19 


TCTO ATiT i CACCCTTAQAAC 


CCAOTTTTCCTGTCTCTCTT 


144 


rro>iofOT)9 


ATGCAAGTTCTAAAACCATACTO 


TAGTGAAGACAGOATTOTGTTG 


137 


frGW9 


ATCAAOTCCCACATC AGC AOCCT 


AOTGOTATOAOOOGCATAAOOAA 


tS9 


fGTrtO 


CAOTAAAOAAAGAGTTTOAACAAT 


AOtTGaAGAAATOAQCACTTA 


146 


roDio 


ATCXCAGGCTCCCrOCATOC 


CrTAAATOTAAOATGTC'; CAGCCnT 


149 


<CT>4CT(ClXOT 


CCAOACCCAATOACATCrCC 


ACCCAGGTOCCCrCrTATCC 


236 




TGGCCTAAACACCTCTO ACAT 


TOCOATACAAAGTAAATGOAAC 


189 


(CA)lf , 


i OACATG ATTACCACATTCATC 


GTACAACTGAACAGACTOACC 


97 


(OT)l6 


1 CTCCCTCTOCCTGTCTCTCT 


TCTTAOTCTTAOCCATrCIOA 


144 


(Cn8(CT>3--<CA) 
12 


CTCACCCAGAGGATGCTTTOAA 


TTAACCTGAOAACATOOCACAA 


190 


fCAM7 


AOOOAGCAGOnTOTOGTTO 


TACTTCTOGTCCAACATTTCC 


110 




OAGTGATTCTCArrCAAAAAQGGA 


TCAAGOGCACnrCTACTOTOTA 


116 


(OT)10 


CTCTGCCTACGTCTCTGCC 


CACCAGTATGCTGATATAATTCT 


142 


(CAMS 


TCTCATTCAAAAAGOOAQATGC 


TTTCAAGOGCACTTTCTACTO 


109 


(OTJIO 



C0D2i7 



jcoiaoj j> 



Emm 
Emm 



coj4a} 



t»:-iH'W 



ICQ2123 



Essm 



JCa260J 
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Table 2A (cont) 



C027ia_ 



CO2t05_ 



CO2M06 



bCTTOOATGCTATTCGCTCAA, 
CCCTOCATAQAGCCIOCI ICl 



n Ar AAfiAACAGCTATOAPAGC . 



TCCCrCCTCCTOTGTCTCr 



CAATGACTTr^AAACTACATTC 
ACJCirrrOCTTATTATATOCTTG 



TCTrCAGTGTAAGATTCAAACX: 



CTACACCTGTGAAACTACCA 



US 



159 



(OT)6(CI>2CA(TG 



(OT)nCAG(A3T4} 
(Cn>A£{TA^>Kr 

iffl 

(TCKTCS(TC>6 



1 CCTACATGOAACCTOCTCTTC 
1 ATCATOGG AOAOGGTGGTAT 



TGTCrTTCCCrCAACAAGATO 
OOQTAOATAAAOACCTQTAAO 



122 



I TTCAGAOTTAATOATGCri AGQ_ 



lACTTGTGTTACCCCri I AACC 



OAGATTCTCTCCCTGTACCAC 
CXTGCCTTrATGGAGTTTACA 



133 



(CA>I< 
fGT)7fOA)n 



rCA>5TA{CA)13 



1 1 CCCTCTOCCTCTGTCTrCTAC 



ATC AATOAAACAAAAGO AACAGT A _ 



(OT)l6 



rCCTG<^TGOAGCCTGCTTCTC. 



CACACCAATTAAACAATAOACATI 



IS5 



CCATTCCCATAGAOAGGAA 



ACCTAGCCAGOACTGOAAAG 



118 



TOAOTATTATGACCTGGAGGGT 



TCAGTAGOTTGTGTCTAGCCT^ 
TCCAGTCACCCTCCAAGATGT 



97 

Its 



(CA}7rA 
CCA>» 



fGDncrro)3 

(AT)11(TA}8(CA)1 



COUOZ 



TCTCAATOATACAAGAACTTCAC 



ATrGGCATCATTCCACTGCTCA 



TGGAGGCAGCTTAAATCTCAACA 



TGATAAOAGTQTOAACAOAGAAG A _ 



CTAGGAGATTGTACAOGTGCT 



95 
273 



GGTCATCnTATACCATCAATTAG 



CTrrAATGCTGGCAGATGCTAT 



CAATTCTCrCTATOCCTCTTTOT 



TcrrcrrcATTCACAOCCAATCT 



I7JI 



(ACn6 
(GA>-20 



(CAIIO 



(CT)4T(CT>20T(C 
TllOfCAn« 
(GTOI 



TCGGAGATGOAAACrrrrOTAAOAG 



TCTAGTGGACTGTTCTGAATrTG 



t06_ 
161 



CQ31507 



ATCTCGTAATITCCCATAATACTTA 



ATCAAOTCCCACATCACACTCC 



TACra^AATOGCAACAOTITA 



CCTTAGACCATCTACCTCTTTTC 



CATTCTGCTCATCTCCATAAO 



GGCACAACTAACTCATTTCTAT 



J«8 



CCTOCATGGAGCCTGCTTCTC 



TCGCTATTTATGCAGCATCTCTT 



|C093U 

jca3»3 



OAOCCTOCTTCTCCCTCrO 
i AGCCiGCTTCTCCCTCTOTC 



OAOACCATAATTCACAATrCrrC 



156 



TOTTOCrTACCCTTCTOTTAOA 



151 



i AGTTtX!ATCCACATCGrfCCA _ 



AGAAAOAGCCTAGATGCCCAT 



141 



I TGC i f C r C CCTCTGCCTGT 



GGCTGTAAGACXiCAGATnrCT 



134 



{ ACATTGGGTTCCTGC ATGGAG 



GGCAGTTTGGTGATGTCTATCAA _ 



237 



OTGCATOOAGCCTOCTTCT 



ACCTTACCACXCTCCATOOA 



161 



tTAGTGCTCATGGAGCCfTTCA 



TATGCTGATTCCACCTACCTC 



83 



(CA>2(CA)5TA(CA 
>6(OA>6 
<CA)5GfCA)17 



(CA)13 



Crc>l2ATGA2T(A 
?>T3 An 



{CTO^GT>t(KCn2 



(TO)! 9 

(CT)6 (TA3)2n' 



A3ynA4yrA3>< 



(GD!3 



I TCAAATCAACTCGTOTTTCTGT _ 



GGATCTGATAATCCACnTAGA 



I OAAOGGACAQ AGAAAGAAATGAC 



TGTAAGGCCTGTTACCTCTAATC 



333 



i GOGTCTCCTTATCACACTO_ 



AGCAACACrrGACATTATTT 



135 



S ACCAAATOAGCCACrrAOar 



CCTCTGCCynTCCTCTATG 



109 



i AATGCTGTGOAAGGTOAATOATA 



ATOOAGCCTGCrrCTCCCTCTG 



224 



I TCAGCAACTATACATTTAAOAGCA_ 



CTOTCCCATCTAAAGOATAGO 



160 



I atcoagtcccacatccito^ 



CATTrACTGGTTTGTCAGTTAGO 



120 



ITGGOAOATOAAAAOTA'TOTC 



CCTGTGCCTCAAOATAOATO 



250 



I OAGTTCCTTCTTCCGC ATCTAG 



ACTATTCAGAAAOCAOTACAACCT 



120 



\ A-TCCTAOTrAGGCATGTdCTT 



OGTAAATrACAOCAOGTGAT 



205 



I TGGTTATTACrO AGC AG ACATC 



G C YVV \ U 1 rrCCTTCAAATAC 



168 



I AG AACCrATCCAGCrATT ATAGTG 



CTCTCAOATATGACCAACXTA 



2f4 



I ATATACrrrCACTCTCCATCCAA 



AGAAOAGOAGTCTTTOGATG 



139 



I CAOTTGCrAAGAGGTAGGTC 



GTAAATGATTACCATAATAAGOT 



114 



C047i6 



TTCrCCCTCTGCCrATGTCT 



AGCACCCTCGTACTOTTrCT 



TTACCAAGCTAAOCCTOGCA 



TGGAACCATCACTOAAGOGA 



AOACCACCGAATGGATOOACT 



TGGAGTAAGTAGCAATCCrCT 



CITTqOTCTCTGOTGGCAATAG 



i C04806 I GCCTCACTCATCATCATTTTC 



TOGACTTGTGATACACCOCACT 
CAACAACAGATTCATATTTGCTATCA 



133 



144 



frG)> 



frCU3<AC)l2 



<CA)12 



(CAWOAVt 



tfGT)6GArGr)H 



fGT>6A2(GDl4 



<OA)2(AC)ll 



(GTOl 



flSiUL 



OX?)lt 



(CA)I3 



ATC)ACrA3)2(TAt 



207 



180 



fAOl7_ 



fCA)17 



(Td)l8 



ICO4903 lACTGCAAATAACCTOTAGAGTOCT 



ACCAATCACCATTCCCTCATTC 



15? 



(A016 



AAGACTTCACCACTCACAOTCA 



CTGGCTCAGTOTOTATOAATQ 



143 



CTCTTAACCQACCTTGACACC 



AGAACTTGCTTATOAAGTCATOT 



20S 



ffCA>6T(CA)ll_ 



C05I02 |AAGCrGTOATOTGGCTCTCAAC_ 



CAATGGGCAGAAACAATGAGQA 



17J 



(AOl5^ 



(AC)20 



C05JQ3 i ATTGGCATTTATCTTC ATTGT 



AAGAGGAAAOAATCTGTOAACT 



196 



CO51I0 tTGGAGCCTGCi rnCCCTCT 



ACCCTGAOACCATGAGCTAAO 



1«5 



(GT)l6T(CNr>lACr 
SI 
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Table 2A (cont) 



I nrrACTAACTaSTrCKSATTTCATC 



I CTOCTTG AACACTQCCATC 
loAClCCTGCTTCTCXXTCT 



1005205 



lATCACOACXTTOAACCTAAO 



1 tOACCniXXSOAAGCTGOACL 



injuiccTQCTrcrrcccrcm 



I ATnAAAQTQACACATCATATT 



I TATTOCATCCTGCTTCCAO A 



I AC AATAGCCTAQ ATATGO AACCA_ 



i TOAAGTAOTAGCCrAACn^ACA . 



i cogtcx:atooaocctocttc 



t ocmjCAToo AGccTocrTcr, 



iC05405 



ICTOTATOOAOCCTGCTTCrC 
IcrAAACCACTDAGCCACCTQ 



I CAQGOATCrrOCTTTTAOCAT ^ 



I ATrATTACTGOTOCCTTATTTAG A 



} CCKyrcaCATOO AGCCTOCfTCT 



I TTTCAOTCCAOCCAAATG AAC 



I GAGTCCCACATCAGCCTCC 



{ OCCACCCAGGG ATCTT AAAT 



{ TACC ACTCTOCrrGG ACAt 



\ GTCCACTTCCAATTGCCGTT 



i AATCTCTCAAATCTCCTCCAT 



CftS506 JCACATOOOCCAATTCCTATAA 



fCftS506 I 



tTGrrCGOTAGCATAOCAtAOAA 



tCTGCTTAOAOTOCTGTACCAC 



I TCTAGAGO ATCACATGCAA 



t CAOATOTTCAOAATGATI J AATAG 



i TATAGTAOGATTCTTOTOOTrO 



I AATAATOAAAACAGCCAACTT 



I AATAATOAAAACAGCCAACTT 



I CTCCCTCTGCCTOTCTCtCTG 



{ TCTCCTTCTOCTACTTCTCC 



I GGCTGCCCCTCAC ACATATT 



I CTGATATAGGTAAGTTGCATnTG 



I ACTCTCTTCCTGOCTTGTAO ATG 



I ATC AACXIACACGCTCCTTCTT 



t OAAATOAACTTATGAMn" \'J \ O 



I ACCAAATGTCAATCAATAO ATG AA 



CM504 



CCTGAATAOAOCCTOCrrCTCC 



f CCATG AATOTrOAGTGTCTCATA 



I ATAOTG AAATTOCCCr AGTGGT 



iTOTrGCTCTCTCTGCCTAAT 



< ATOerrAO ATGTAGACCCtTAG 



iTCCrCCTTAOOOACTQTACCC 



tCTCAOAATCAGCAGCAGOTGCC 



\ CAOAAACTG AGATGTOTC AAAAGTCC 



I TTCTOOATOAACA I AeC f 1 > O 



I OAGccTGcrrcToccfrcrcc 



\ AanTOAACATCCrrAAATrO AT 



i CTAGTTCCATCCACATCATrG 



IC07DJ1 



jITCICCCTCIGOCTOTOTCT 



t OAAGO AAG^ACCAGTAAAOT 



I AGTCAC AOAOGOCAGTOTOO 



TCTAATCTCCATTCAACACAAOTOA 



iTGCATACAGTAtCAATTTOTOA 



i ACTATATTQACAAGTATCCACAAGA 



\ GATAOATOAATGGATAAAGAAA 



i ATCACTAAACCACCACCAOAO 



onCACCAAGTCmTCATOTAAT 



GGCATGOAGCCTGCrrCTC 
TACCTGTCACCATACATAGT 



ccTGcrrcrcccTCTOccT 



CCATCAGTGGTOrrATCrOTA 



CCAGGAmrOOAAOOTTCr 



TQAAAOOACOCTOAATTQO 



CAOCCAamOGCCCTTCT 



OCTGCAAATAOCAAGAATTCAT 



TAATCCTAATCCACTCTAATOGT 



ctoaacx:atccaoatgtccaoa 



CACCTACCTCCCCTTCTGCAA 



ccttgaaggatattgtotcx: 



ATGTOTAACAGAAGCCACTAA 



ATTGATGrrrrCrCAOATTC 
TCATGGGTCTAAgrOTTTOOA 



GGGAOATAGACAATCACCAAAT . 



rrGCGATOOAGCCTOCTrCT 



GCTGTrrACACAAAACATAOAAG 



CCATTACCTCACATGGrrACn' 



ACTAATrCCAATGTACTOTTAC 



AAGTACAGG AATTCTGTTATGAO 



CTCTGATTCCTCTAGrrTCnrCCT 



GTATTGGTCAGGATTCTCCAO 



CCTCAGmTACATGAACTCA 



CTCAGCTCCTGOACACITCCT 



CTrCTOOACTCCTGCCTTCC 



ACCTOATATOTGGCATqrrGT 



ATCOAGTCTCACATCGGOCTC 



ATCATAATGATTOAATGAOAT 



rTATTTAACCCACTGAOCTACC 



OOGCTCTTCCTTTGTATCnT 



TAGTGOTGGGGTTOAAAOAO 



ATAACATCTGOATTGGGTCTA 



CTGGAGCCTnTAAGGTCATT 



TAGCACTCirCCCTTCCCCTTA 



TTGGGGOAOTAGCTTCATrrCTG 



AOGGATTAGTGAOTTGnTACC 



CTAGACCCATCCATGrrOTTG 



TGnTATrGCCCATITOGAAA 



GAGCATGCTTCTCCCTCTG 



TATCATACTOCCCATTATOrO 



CTTTCAATCTGTTGOTGTCTAT 



TGTCATCCAGOCAATAGAACT 



OCATCACAOACCTGTCAGOAAC 



OTTGCTAAGrrACAOACATCACCA 



ATGCCATO n C t G ATGCTCTTG 



TGGTCAGGGOTAOAAGAaTO 



OTATTAATGOATGOATTOCA 



AATGCAOAATCCAAGAAATAOAO 



ACAGTCCAAOTGTCCATCAAC 



GTATCrnTATACCrrOOACCTAT 



TTCTTAGAAAOACCCOAGTA 



ACATCCOCTrrAATTTOTrTC 



CGGATATAAAGOTOGGGTATT 



GGATAOAGTCCXrACATOGG 



QAGCCTGCCTTTCOCTCTG 



TTAGCATAACACTCTCAAGTT 



AGGTAAAAGGCGAAAAOAACTT 



13g 
167 



164 
224 



151 
178 



132 
105 



i4> 

300 



152 
141 



in 



263 



2«0 
101 



231 
1S3 



150 
73 



163 
234 



243 
136 



2L- 

168 



IPS 
227 



194 
9S 



131 



140 



138 



105 



177 
167 



128 
143 



131 



214 



186 



114 



161 



137 



131 ^ 
206 



166 



81 



1S6 



US 



138 



215 



138 



118 



187 



168 



183 



135 



129 



fCA12CG{CA>9 



(CA)18 



fCTV2<OT>l4 
(AT3>I0 (A1>3. 



<GAnG(dT>14 

(cyni3 



(A018 
fGT>14_ 



(TA>3(CA)13 



(A015 



JSQI3L 



^cTgrroio , 
fcm3(GT>i3((rn2 



(GA)2(CA)6TO(CA 



<TGOTArrG>7 
(CA>8 



rCT>15fGT>7(CT>2 



{CA>8 



(GDll 



(Aa7 



fAC>9 
fCA)20fAC>8 



nrG)iiT3(GrM 



(CT)n(AC)7C(CA> 
10 



(CA)14 



fAC)19T<CA>4 
(TQ)t5 



fAT>4(GT)7 



(cr>i2 



fGT>12 



(GT^14 



fA3T>H 

/CA)IOTArCA>8 



<GT)M 



(GT>15 



nsnjL 



(CA)11 
(CA)16 



<CT)6(GT)7AT(GT 
>2{CT>2CATO(AaT 



(CA>S(0A)13 



(CA)n 



fCn9fC:A)10 



rcA>to 



fGDl2 



(GTU5 



«mi2 



(Cl)6(GT)15(AaT» 
2L 



(Gnu 



fOT)i7 



^GT^I0fGA>9 



(CATflO 



<OT)ll 
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Table 2A (cont) 



CAGirACATATACCATTAOCCA 



ACATTGGC}CCTAATnAAT^ AT _ 
TOCCATCTrCTOATGCrC»-lO_. 



tTCArrCATCAAaTCCnrAOTTAT 



1 TTCACAGC AOGO AAACTOTTATG ^ 



1 AACTtnXlTACTTCClOl iCAfl 



iCATCAcarrcAOACTorrAcrrGTr 



I O ACTO ATOGTOO AGGOTO AO 



I'ClTOOAAATGTAATOTgTOTA 



lATmrCTTAQCCAOTCATAAATC 



rTCATCTACrrCCTOnrOTAOCC 



^^^GCAOATOCTCAACCAC^TIL 



lACrrAACTAOAGAGCGTGTGACr 



lATATACmCACTCTCCATGCAA 



I CACAO AATACTGOAACTCATTTAO 



fAACTGACATCAACAGTCtOATAC 



ICTACATOOAOCCTGCntrrC 



i CCTGCATGOAOCCI^iC ijyuL- 



I TGGAGCAAGCACTTTCf ATAAAC 



I <XTCAAATAATGOAAGTO0CT_ 



i GGTGGCTCAGTGGTrTAGCA^ 



t tCACCTACTG AG ATACTTCCAT 



I IrrCACCrcrO ATTAT AfCTAGG 



{ CCAGG AATAOACAATGCOCA 



I CTCTGCATAATGCCTOCr 



i CCTACTGTTTTCTGTATTOGCA 



I TCTGTCAATGG ATAAOTOOAT 



I CT AG ATTCATCC ACCTCACTG 



I TTGCCATCACTO ATACAACT 



rTTACCCTTCGCTATCTATCTAT 



iTGGAGCCAGTTTCTCCCrCTO 



I TAGAGC ATCAGGTCCCACOTC 



i AACTGGAAAAATOGATAAfCAO 



tCtCACTAAATTCACGTTATTOA 



I AGATTTOAACCAGGAAATTAGOAA 



I TGTTAATCTTCCTAATCtrCCAG 



ITGGAGCCTGCTTCTCCCTCTG 



tTGGAGCCTGCTTCTCCCTCT 



I GCrGGrrCTTTCTCTATTTATAC 



t ACrOCTGGTTCl TI C J CI ATTT 



I CTAGCrTOCTCCAClOAOnCC 



t GCrrCAGGAATCTAGGOACAA 



i cmSCCroCTtCTCOCTOlCTC 



I GTACTCTCCncTCCCTCTC^ 



I CCCAAATTAACCTATATITAOAAC. 
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179 


fOA^U 




0 AGO ATCAAOTOCCATATTO 


GTAOOCAOCGTACAOATOOAT 


135 


new 




COCTCATOCAAOTCATCACAT 


ACACrCTGOTOCAAOCOACrC 


125 


fcnt5 




CATnCTCAGCATOTATTATAO AT 


GTCGOGCTCCCTOCATAOO 


120 


fOAM4 




TACrCTOG ATOCACTCATAAOO 


TOOCTTAAAACCTACTCCTCAO 


123 


fCT^M ^ 




CTTTGCXrAOTTATOCXjTCrOTO 


TGCCTOTOTCTATOTCTOOCA 


132 


rGA)l6 




OCCTACrrCrCCCTCTGCCTATG 


CTGCrrCTCCCTCrOCCTTTix: 


163 


rCAWOAMO 




anxKnrcTccrTcrGCCTGT 


TTAOnTTCACCAACTOTAOOG 


154 




UliliiM 


CTOCATOG AGCCTOCTTCTCT 


OAATTOTAAGrrrrCAACTOCC 


144 


(CT(9aCT>4 




ATCACAACCTAACC AAAACGCT 


CTCTCCCTCTGTCrOCCACrCC 


142 


{OA)12 
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Table 2A (conL) 



T ACCCAGGTGTCCrrCAAAATOT 



lATtK3ACKxrrcxrrKrrccciCT 



InfytAcm-rAOTCTAGrrCTCrO 



rrrAAATTCACX>ATrrCTOACTCC 



t bcAGCCAATATO ACATCATCC 



ITOCATOOAOOCIUCI ICIU3C 



1 tAOCTCm>AOCTOCKITOTQOA 



GCTCrOCTCTOCCTOTCTCr 



TCAQCTATAAATTCAACTOGCrrA 



TAATCAOQTTCTTGGAOATOAC 



ATOOAOCCrOCI ICTCCCtCT 



TACATOOAOCCIliCi JCICXX! 



OAACAOCmTOCAGCAOCC 

hQCAcrrcAcrTAA 1 ci iTaAaT 



2 06 . 



151 



I3P 
161 



161 
175 



H4 
174 



(OA>9 



<cni4<crn(CT}i 

<OA)gAT(GA> 



COA)8 



O02fl3 



cxa«i4 

O02gl5 
OQ28I7 



OOACOATCOAATCCCACGTC 



TCATTTOTCACTrATrAOrCCAC 



tocrocrnrATAGTAAAAATO 

TCCrroCTOAATATOACCnTCA 



CCI GC TTCrCCCTCTCCCrAT 
AAOGOAOGGQAAACQACACAT 



065 
154 



{CT>2CCCCD»(CT 

£ 

<CA>5^0A>U 



ATCGAATCCCACATCAOOCTC 



CACAAATOTAAACTGOOTATATT 
CACCAGOTTOQAAATQAATAAG 



m 

154 



O02>»9 



ACACTCA<X:ATAOAGTCTOCTrO 



002831 



CCTGCACAOAGCCTGCTTCTC 



aaacx:actgagccacccooact 



153 
341 



OQ2902 



OATTOAOTCCCACATCAOOCT 



AGCTOTGnTATOACTACACATQ 
CCAATrrGAAGOATTCATCATT 



(cr2TO(cr)6 c 

CT)6 

(CT>2GT(Cr>70T 



002903 



TAGAGCClXXnTCTCCCTCTO 



146 



tTCCATCTOCCTATCACACCACT 



TGAGCACTGOATGTTATATOCAA 



t ATCT AATCCCAC ATTOGGCTC 



ATQQGOAOTCATCAGACCAGO 



199 
171 



I TAGOCrrCCTGCTCAAG ACAO 



GOATOOAGOAOAGGCTTOTTA 



I CTGCTCrCTTTTCOCTCACTC _ 



rrCTCCCTCTGCCTGTOTCT 



209 
141 



lACTOAOATOOOAAOOOGCAOA 



CTACATCGOOCrCTATOCTC 



toAoarrGcmrrcccTCroc 



AGTCCTGTGATrAOrrCTCAOAC 



83 

1C6 



I TCXrrCCC AACATTCTACAATOAA 
I TGCTTCTCCCTCTGCCTCTGT ~ 



134 



CCrrCTGGATCTGCmTACTAT 



203 



\ QC ACTCAOATGTCCCTAf ACTAT 



AAACAGGATCGAGTCCCACA 



tTAGCAGACAAACCCCAACTO 



OAGCCTGCnTCTCCCTCTO 



212 
167 



i CTGCATGGAGCCTUCnCl i 



TtnTATTCAAATCCTCCTOATTAT 



i CCTGCATGOAO ACTGCITCT 



TGTTTCCTCACTTCTTACTDA 



153 
2» 



I GACACCApCTTOATTAroATT 



TGGAQACCTGGOATrOAGTC 



I ATCACACCCTGGGCTOAAOO 



TGQAOCCTGCrrCTCCCTCTO 



166 
174 



i AGGATGCCCAGTTACATTTOAA 



TGATGTTTGATGTTCACOTTOAT 



20S 



t C ACnmrrATOTCCTCTTATr AAG 



CAGGAGTCTACrrrrTCCTTCTG 



170 



I CACTAAACCACTG AACAACCT_ 



GTCCCACGTCAGGCTCTCTQ 



15S 



ICACTAAACCACTGAACTACCT 



GTCCCACGTCAGGCTCTCTO_ 



15S 



\ TGC ATGGGGCCTGCTTCTCL 



CCTCTTCATACrrCAGCAAGTG 



169 



IG06202 



1006221 



I cCCTTCTCTOTCnrOAGAGT 



AGCCTGCTTCTCCCTCTOCC 



144 



icrrcTCccTCfGACTGrrGrrcT 



TCCCTCAAAATTCAACATACAA 



168 



jccTGcrrcrcccTccTCCTO 



TCCACAAAGCTCCCTACTCAT 



163 



\ C ACTGGGCGTGTAACCTOCT 



CTOAAATGTAAGTGCAAAGOAA 



172 



I CTAATATCAAAAGGTfATCCAC 



catcttcxtotxkx:aototc 



267 



I CO ATAACCAGG ATAATTTCCTAC 



AGAOAOGCCCACATCAGOCT 



156 



(TC>9 



£1211. 



fOAT>6_frC)9 



(OA)* 



fcpio 



fOA)3CAfGA)9 



(CT)13 



fQA113 



fOAMO 



fOA)i4 



fOA>U 



(OA>9 



(GT>9 



(GA3X:(AG>90(0 
A>S 



fCniUOA>3(CT>2 



fCmi2.,(A3C>8 



fOA>n 

CAT4XAT3p(OA)i 
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O06706 


ATCOAOTCCCACOTCAOOCT 


TTATrrAnTATTCATACAGATOCA 


9S 


(CT)13^TO(A2T 
XA3T)2(A5T) 


006707 


GGTGCATOOAOCC 1 UCl 1 CT 


tOCCAGTTCAOrnTCAAAGTT 
AACCCGGGATTGAGTOCTO 


147 
167 


(CnniGTO 


006730 
C067X3 


TTCCnUl i iCTATrCTCCTC 
GAGATCOAOTCCCATOTCAO 


crrroAGOAQATAAAicmurA 


125 


(OA)l4 _ 

0020 


O0ff7J4 


ATCAAATCCCACATCOOOCIC 


ATtAOTTCAAACXrrCOOCAATO 


163 


frc)i2 , 


G0^15 


TTGATCOAGTCCTACATCOO 


TCTTOOGTAAACTACTTAACIT 


174 


^11 


006717 






147 




OQgtQi 


AGOOAOOTGCI JCHXI ICTO 


CAATGATTATOOTITOTCAACTr 


162 


fCT>l7 


CX>6>05 


OACACCCAACOCXrrOAOCAC 


GAOOcrGC-rrcTcccrcTOOc 


16S 


rCA)3rOA)9 


006902 


OOCAGCrrrOATaACTOATTTOA 


AGTCCTCTGTCAGOCTCCCT 


211 


fOA>l7 


006908 


OOAACACGTrAATTCATAAAAATOAT 


AtOOAOTCCCACCrrCAOGCTAC 


203 


rOA^CAfGA)]0 


OQ6909 


OOCAOCACTAAACCACTOAO 


TGCTTTGCATCTTCCATTTT 


209 


(OAHS 


G06910 


CTCTGCTCAOCGOOOAOTCT 


TrATCrrAOAQTGATGOAGAGTOO 


iOl 


(cnis 


006914 


GOAAOAT0rrGTCTCtTATCA 


GGGTAOGGOl 1 1 iUi i lATOG 


159 


(OAUO 


GOTOOl 


CTACATOGAOCCrGCTTCrcC 


TCCCCACAACnTATGTCCTC 


129 


fcniifOTv 


007002 


CCTTCrCCCTCTQCCTOTO 


OCCACTGATrrATTCTCTOTA 


197 


(cnu 


007004 


TocrrocTCTcrcrrcAAATAA 


OTOCATOGAOCCTOCTTCT 


UI 


(OA)10C^GAVS 




CCCTCTGCCraTGTATOTOTC 


ATOOCAOCAOOOAGTAOTCCA 


134 


fcni2 




CAGTCGOOAATCrOCnTOAO 


CATTTCACTACATATACAGOTCTCA 


150 


«mn — .fCDio 


rr'.W.IiVJi 


AATACCTGGCTAAACATTTA 


ggoatcoagtcccatgtc 


156 


roA>n 




GTGCATOOAOCCTGCmCTC 


AATOTACCrcrrCCCCTTTTO 


127 


«mi3 




OCATTCACCCAATAGTCCrrO 


octgcttctccctctgcctac 


135 


raA>i3 


|C0730S 


TCTCAATTTQAAAAGriTAl Ati TC 

. 


TTCTCCCTCrCCCCCTATC 


174 


(TA3)9 — CGA)5— 
rOAl!7 


Kwm 


TATOCTTCrCOCTCTTCTCTO 


GGTrrCTCTCCrTGATTrGTAAG 

tgctaaactcaactctcctaa 


159 
123 




rcmi4 




CTICTCCCTCTOCCTOTOTC 
CCATCAGTrTGTTCrrCTATCA 


GAAOCCTAAGTGAOGAOTAa 


224 


fcnii 




OGAOCCTCCf '1 C 1 CCCTCTG 


tatcgtgccacactgctoaat 


244 


fCI"HlT2(Ci')3 




AATTTAOTCOAAOAATOAAAOATG 


OAAATAOCCTTAAAAOCAATOTA 


221 


fOA)14 






TGOAGCCTGC I ' 1 C I CCCTCTO 


134 


(OA^tO 




TGTCACTCTCCTCTCCACCTO 


AGTGCCTAAAGTrCTTCCTATrO 


135 


fcnu 




A\CliXr['kCTGCACTCCtOCT 


CACGTAAGGGATOAGTTCAOGT 


147 


rrc)8rroi2 








231 


(OA)24 




TCCCTQAAAGGGGCATrTAAOACC 


AOCCTGCTTCTCCCTCTGCCTATG 


m 


fOAMl 




TCAGOAGGTOAOTTGCTTCOAO 


CGGTOCATGGAGCCTGCTTCr 


162 


fCA)3fOA>l6 




CTCCCTCTGCCTATOTCTCTO 


ACAGCCCTGnTACCOAGCTO 


253 


(CT^u 




CAGCAAACTGCTOOACTTOTOCr 


TCCTTCTCCCTCTGCCTOTGT 


S26 


(GAn5 




AOTrCTOOAGOCTGGOAAOTC 


OGTGTGAAATOGCTCTTTAOATA 


215 






7GCATGOAGCCTOCTTCTC 


AOCAGGTTACTCTTACTOACTCC 


138 


fCDU 




ACTTCTCCCrrCTGCCrOTGT 


TrcCAGTCTATOTTOATTOAA 


124 






ATGGAGCCrCCI ICJCCCTCT 


o r 1 1 CCTGCTCTCCTAOCTGO 


163 


P5J9 




AGCCCTGCI 1 iCiCCCCTCr 


OATnTGATrrACATTCACAAGTACA 
TCCCACATCAGGCTTGCTOTAT 


98 

152 


(TCloiO 
fOA)9 


1007701 j 


AGGCATCCCrTACTTACTTACTTG 
TATrCAAGCCATrGACOOATTG 


CATGGAOCCTOCTTCTCCCTC 


247 


crG)2(rc>2Gcccr 

OI6 




CTGCn C rCCCTCTOCCTATO 


TTTCCAACArrATOCTATOAT 


198 


rcni4 




AGCCTOCCTCTCCCrrCTCCA 


AGAGTCACAATOCAACCCCACAA 


246 






GOTOACACTATACTOAACCrrcr 




116 
302 




CTCOCICTGCCTOTGICTCTO 
CATrrcOCTCATGTGCCTOACTGA 


AAI 1 1 f TATOTaTCCTOOrrCAGCC 
CATGOAGCCrrOCTrCTCXCTCTCC 


147 


(CTP 
fOAM6 




GCITCTCCCTCTOCCTCTATCTCT 


ATTOATCCCXXJATnTOOTAATA 


175 
149 






TAtmcrr i CTGCccnrcrcc 
crocATGOAOccrcci icrc 


CATTTCCAATCCATrrAOAOA 
TCAOACGCrCAACCAACTOAG 


179 


fCTO 


1 0077I3 i CTTGAyiGGCXKXn^UI ICHO 




234 


rOA)16 




CAGCAtGOAGTCTOCTTOTC 


AOCTAAACATTTAACCAACTOAO 


219 


fCT>14 




UGG 1 AO AACI'G ACAl-ICrn 


CTGTAOOGAGCCTGCrFCTC 


133 


fGA^7CArGA^ 


|gOS002 I 


CCTATGCTCCTGOAOAOCTG 


CTAAT710ACOAGATA0GATACATAAT 
A 


153 


CCDI7 




GTCAGCTTAOCCATrOAAGAAT 


CCTGCTTCrCCCTCTOCCrC 


174 


(CA)2(OA>15 




GTCAGCTTAGCCATTGAAOAAT 


CCTGC I 'I C I CCCTCTCCCTC 


174 


fCA\2fOAM5 




GGCACAACACTCTOAATTATrAG 


CACTCATrrATTOCCXnrACTTnA 


173 


fOAUS 


|C0S003 I 


CCTCTTCACTGCAAGGO.\ACr 


CATCAGATACrcCAACAXrCAO 


190 


(aA)20 




CAOAGTATCCrrGCCTOTAG 


GTGCCTGGAGCCTGCTICT 


139 


(CA\i(OAm 




TOGTACTGTAGCmOAAOAT 


TCTGTGAAAOACACCCTATTTA 


173 
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Table 2A (cont) 



IH035O1 iTTGCCTTCTOGGTGTATTGACTt 
■ H03502 (OATCCTGATrOTTCTTOAO 
fiffiSoij TOCTTCTCCCICI'OCCTOT' 


1 OAATCrrOGTTAGTAOAATTATACAO 
GOCATGOAOCATACTTCA 
TGCTOAAAQATrACCXTGTGGA 


300 
U5 
125 


CAT3)1(HAT2»AT 

(AT1>5 (AT4XA 

13)2 


Iho«603 j AAOTCOCACO'IX^^GOCTC 


ACGTCACCACAACCATCTAA 


165 


(AT3)12 


IhO»205 j CATTTOCTOAGTCAAGOAATTCT 


AOTrACCTOOAACTTOTCAGAA 


200 


(AT3)12 


IHOSSOi iTGCATCaAGCCTGCITCr 

IH09308 j aotccack:atcaccgtttot 

HlOtOl j TCAGGCTCATOGGATTGAOACrrc 
! K101D3 1 TCCACACTCAGTGCAGAATCTGCTT 


CTTCTACACATGTTQTCCCT 

OAGOCTTATnTCTGTCCAOTT 

TOCCATTOCACAGGATATAGGTCCA 

TGTQAGACCOCAGAATACAGTACTC 


leo 

144 

305 
141 


(AT6XAT4)2(AT3) 
!3 

fAT3>9rAT4> 

fAT3)n 

(AT3)U 
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Amplification reactions were carried out under standard PCR conditions 
described above using the annealing temperature indicated for each locus or a 
touchdown PCR protocol (Don, R.H. et a!., Nucleic Acids Res. 19:4008 (1991)) was 
established. The variability of these loci were evaluated using the dog panel. For 
5 each locus, 5*10 dogs were studied in each breed. The number of alleles observed 
are presented in Tables 3A and 3B. 

Table 3A 



Marker 
{ Locus 


Mixed 
Breed 


Cocker 
Spaniel 


Labrador 
Retriever 


German 
Shepherd 


1 

1 _ 1 

Beagle 


D00101 


3 


2 


2 


2 


3 


D00401 


5 


4 


3 


6 


4 


D01205 


4 


2 


4 


4 


4 


DDI 902 


6 


4 


6 


3 


4 


D02001 


4 


3 


3 


2 


4 


D02006 


3 


3 


3 


3 


3 


D02011 


3 


1 


3 


3 


2 


D02012 


6 


4 


3 


3 


4 


D02202 


4 


1 


2 


3 


4 


D03709 


5 


4 


3 


4 


2 


D03805 


6 


4 


4 


3 


3 


D03908 


4 


4 


3 


5 


4 


D04403 


2 


3 


1 


1 


3 


D04702 


3 


1 


3 


2 


3 



WO97/31011 



PCT/US97W2396 



-23 - 
TabI 3B 



Marker 
Locus 


Doberman 
Pinscher 


Siberian 
Huslcy 


Scottish 
Terrier 


English 
Pointer 


Greyhound 


D00101 


3 


2 


2 


3 


2 


D00401 


3 


6 


5 


5 


5 


D01205 


2 


2 


1 


3 


3 


D01902 


5 


3 


4 


4 


7 


D02001 


2 


4 


3 


2 


3 


D02005 


1 


3 


2 


3 


3 


D02011 


2 


3 


4 


5 


2 


D02012 


3 


3 


4 


4 


3 


D02202 


1 


3 


2 


2 


1 


1 D03709 


4 


6 


4 


5 


4 


1 003805 


3 


7 


4 


5 


4 


1 003908 


3 


8 


3 


4 


4 


1 D04403 


1 


3 


2 


3 


3 


1 D04702 


2 


3 


2 


3 


2 



In genera), a)) of the mtcrosatellite loci tested displayed variability within and 
across breeds. While 9 cells out of 140 (6.4%) in Tables 3A and 3B were 
monomorphic, these were scattered though 6 different microsatellite loci, which were 
quite polymorphic in other breeds. The maximum number of alleles detectable by 
this analysis for a locus in a given breed was 8, in the case of locus D3908 in the 
Siberian Husky, The percent heterozygosity obsen/ed at each locus in each breed 
is presented in Tables 4A and 4B. 
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TabI 4A 



II Locus 


■its _« 

Mixed 
Breed 


Cocker 
Spaniel 


Labrador 
Retriever 


Geiman 
onepnerQ 


Beagle 


D00101 


20 


0 


n 


0 


90 
25 


D00401 


100 


100 


inn 


88 j 


1 D01205 


70 


50 


n 

u 


22 1 


64 


1 D01902 


100 


100 


lUU 


11 I 


36 


1 D02001 


40 


86 


Of 


50 1 


33 


1 D02005 


90 






22 1 


27 


j D02011 


38 


0 




44 f 


18 


1 D02012 


0 


17 


33 


0 1 


33 


1 D02202 


20 


0 


0 


0 1 


0 I 


1 D03709 


20 


100 


75 


89 1 


50 1 


II D03805 


100 


50 


50 


30 1 


67 1 


D03908 


100 


100 


100 


88 1 


100 il 


D04403 


100 


100 


100 


100 j 100 il 


D04702 


22 


0 


80 


0 


30 i 
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Table 4B 



I Locus 


Pinscher 


Siberian 
i^uslty 


Scottisli 
Terrier 


Englisii 
Pointer 


Greyhound 


Fdooioi 


60 


0 


78 


86 


38 


D00401 


33 


50 


86 


67 


100 


D01205 


60 


44 


0 


86 


25 


001902 


100 


63 


100 


100 


100 


D02001 


100 


57 


25 


50 


13 


D02005 


0 


50 


77 


71 


100 


D02011 


20 


33 


44 


43 


50 


D02012 


0 


50 


17 


40 


0 


D02202 


0 


0 


17 


17 


0 


D03709 


100 


78 


100 


86 


100 


D03805 


100 


67 


100 


80 


29 


D03908 


33 


44 


100 


100 


100 


D04403 


30 


50 


56 


14 


29 


D04702 


67 


20 


33 


60 


40 
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No heterozygotes were observed in only 21 out of 140 (15%) of the loci/breed 
combinations studied. At the same time, 30 out of 140 (21%) cells showed 100% 
heterozygosity. The mean and standard deviation of heterozygosity observed for 
each locus across different breeds, as well as the mean and standard deviation of 
5 heterozygosity observed within each breed across different loci are shown in Figures 
1A and IB. respectively. The breeds studied show a mean heterozygosity ranging 
from 36 to 60% across different microsateliite loci with considerable standard 
deviations. Among the loci studied D03908, D01902, D03709 and D00401 showed 
the highest mean heterozygosity across breeds of 87, 81, 80 and 75%, respectively. 

10 The number of repeats in the reference done in these loci were 16.18,12 and 22. 
The least informative loci across breeds were D02202 and D02012 at 5 and 19% 
mean heterozygosity, respectively. The number of repeats in the reference done 
in these loci are 12 and 15. respectively. Correlation analysis did not reveal any 
significant linear relationship between the number of repeats at a locus and its 

15 overall observed heterozygosity (p=0.22). 

Figures 2A-2D show the results from typical gels used to evaluate the alleles 
in gathering the data as described above. Amplification products of DNA from 
various different breeds at the locus D02011 are shown. Figures 2A-2D represent 
different gets, run under similar conditions. Note that the molecular weight marker 

20 identified in lanes marked M is the 246 bp band of the 123 bp ladder (Gibco-BRL, 
Gaithersburg, MD). The size of the amplification product in the reference clone was 
238. The different alleles are easily identified, with PCR products separating in 
sharp and well resolved bands, near and below the 246 bp marker. Some 
non-specific amplification products can be observed, especially in cases with higher 

25 template DNA concentrations; however, these do not interfere with correct typing. 

The results indicate that microsateliite loci containing CA repeats are 
abundant and highly polymorphic markers for the canine genome. These findings 
indicate that such markers hold great potential for use as linked markers for genetic 
defects in pure bred dogs. 

30 The estimate that there is one useful CA repeat every 31 kb in the canine 

genome is in good agreement with one every 42 kb estimated recently by others 
(Rothuzien, J. et al.. Theor App. Genet 89:403-406 (1994)). In the above-described 
study, a secondary screening was carried out and only very strong hybridization 
signals were accepted as positive, which resulted in elimination of about 20% of the 

35 primary positives. It thus appears that the estimate of the minimal CA microsatellites 
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frequency in the canine genome is accurate. These estimates have practical 
implications particulariy, since most cosmids have insert sizes in the 30-40 kt> range, 
the likelihood of finding a useful CA repeat in a cosimd clone harboring a gene of 
interest is high. 
5 SPECIFIC EXAMPLE 11 

Materials and Methods 
Patients and pedigrees. The patients and pedigrees used were primarily 
those used and described eariier (Yuzbasiyan-Gurkan, V. etal., Genomics 15:86-90 
(1993)). Briefly, pedigrees of American Kennel Club registered Bedlington terriers 

10 were associated with the help of Bedlington terrier (BT) breeders. While all of the 
pedigrees have a family history of CT, not all had a symptomatic proband at the time 
of pedigree ascertainment. Diagnosis of dogs as to whether they were affected or 
unaffected with OT was made in ail cases by quantitative copper assay from liver 
biopsies perfomned at 1 year of age or older by criteria eariier described. DNA was 

15 extracted from peripheral blood samples collected in acid-citrate-dextrose as 
anticoagulant as described (Yuzbasiyan-Gurkan, V. et al., Genomics 15:86-90 
(1993)). 

Microsatelfite analysis. The microsatellite markers used in this study were 
developed as described in Specific Example !. Standard conditions used to amplify 

20 each marker tocus in polymerase chain reactions (PGR) were as follows: 25-50 ng 
of genomic DNA as template in 25 p\ of PGR buffer (50 mM Tris HCI. pH 8.3 @ 
25'*C, 50 mM KOI, 13 mM MgCtj). 200 //M dNTPs, 200 pM with respect to each 
primer and 1.5 U of Taq DNA polymerase. A touchdown PGR protocol (Don. R.H. 
et aL, Nucleic Acids Res, 19:4008 (1991)) was established to facilitate the robust 

25 amplification of most markers under the same conditions. PGR was carried out at 
94'*C for 45 sec, 52*C for 30 sec, and 72"G for 1 min. 

The microsatellite markers were initially evaluated in ten sets of parents from 
the BT pedigrees. Those markers for which at least one parent was heterozygous 
were then evaluated in all the dogs in the pedigree. Seven to twelve microliters of 

30 pnaduct were run on a 5% to 7% Hydrolink D600 acrylamide horizontal gel according 
to the manufacturer's instructions with the following modification. During the 
overnight runs, a plexiglas gel carrier was placed on top of the gel to prevent the 
swelling and distortion that was othenwise observed. Initially, electrophoresis was 
canried out from 4 to 5 hr. at 50 V in 1 X TEE (90 mM Tris, pH 8.3, 90 mM boric 

35 acid, 2 mM EDTA) with ethidium bromide. A photograph was taken and the gel 
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etectrophoresis then continued overnight at 35-40 volts depending on the fragment 
size of the product, A second photograph was taken and the results visually 
evaluated. It was found that two photographs were helpful in comparing different 
dogs with similar patterns. The alleles were then tabulated and used rn linkage 
analysis. 

Linkage analysis. Two point LOD (logarithm of odds) scores between CT and 
all the markers tested were generated using the MLINK program of the LINKAGE 
package (v5.1) (Lathrop, G.M. et aL, PNAS (USA) 81:3443-3446 (1984)). A gene 
frequency of 0.5 was assumed for CT. 

Results 

Two hundred thirteen microsatellite markers were evaluated in the process 
of finding linkage. Of these 213 markers, 181 provided scorabie products in BTs 
using the touchdown protocol described above. Of these» 114 were informative in 
the pedigrees and were further evaluated. 

Of all the markers tested for linkage to CT, only one yielded a significant LOD 
score. As shown in Table 5 below, marker number C04107 was found to be linked 
to the CT locus at a LOD score of 5.96. at a recombination fraction of zero. No 
recombinants were detected. Since a LOD score of 5.96 indicates that the odds of 
observing this linkage by chance is about 1 in a million, and since, a LOD score of 
greater than 3 or an odds ratio of 1 tn 1000 is considered proof of linkage, the 
findings imply that the CT locus is indeed very close to the C04107 locus and thus 
can be used to predict the inheritance of alleles at the CT locus. No recombinants 
were detected in this study and thus a value can not be put on the genetic distance 
between these loci, except to say that they are very close. 

Table 5 



e 

(Recombin- 
ation 
Fraction): 


0.0 


0.001 


0.01 


0.05 


0.15 


0.1 


0.2 


0.3 


C04107 vs. 
CT 


5.96 


5.95 


5.85 


5.38 


4.78 


4.14 


3.49 


2.13 


C04107 vs. 
ESD 


-oo 


-19.73 


-10.78 


-4.77 


-2.44 


-1.28 


-0.6 


-0.01 


C04107 vs. 
RBI 


-oo 


-20.35 


-11.43 


-5.47 


-3.18 


-2.01 


-1.28 


-0.47 



30 



35 
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The primer sequence and allele information about this marker are shown in 
Table 6. The allele frequencies were determined from alleles obsen^ed in apparently 
unrelated dogs. 

Table 6 



Marker Locus 


C04107 


Repeat Motif in Reference Clone 


(CA)e CT(CA),, 


Primer Pair 


TCAGCAACTATACATTTAAGAGGA 
CTGTCCCATCTAAAGGATAGG 


Allele 1 and Frequency 


163 bp. 0.39 


AHele 2 and Frequency 


167 bp, 0,61 



Marker C04107 was used to locate markers C04107B and C04107C shown 
in Table 2A» which are close to C04107 and also contain repeats. This "family" of 
markers may be used to detect CT. 

A typical pedigree illustrating linkage to C04107 is shown in Figure 3. In 
Figure 3, circles and squares depict females and males, respectively^ and individuals 
affected with CT are indicated by the filled symbols. The asterisk in the figure 
indicates an individual not available for analysis. The bands are the negative image 
of amplification products obtained from the dogs indicated in the pedigree and 
analyzed individuals share the 2,2 genotype at this locus. In this pedigree, all dogs 
with the 1.1 genotype are predicted to be homozygous normal while those with the 
1 .2 genotype are predicted to be heterozygous, and thus carriers of the CT gene. 

Given the finding of linkage and allowing for a small error for recombination. 
It is predicted that all the offspring with the 1, 1 genotype are clear of the CT gene 
/.e., homozygous normal, and that all 1,2 offspring are carriers in this pedigree. 

Since data on the ESD and RB1 loci were available for most of the dogs from 
a previous study (Yuzbasiyan-Gurican, V. et aL. Genomics 15:86-90 (1993)). the 
linkage relationships of these loci with C04107 were was also evaluated. Neither 
ESD or RBI were found to closely linked to C04107 (see Table 5). 

As demonstrated by the pedigree illustrated in Figure 3. given an infomiative 
mating, it is now possible to identify all the genotypes in the offspring, distinguishing 
between the homozygous normal, homozygous affected and heterozygous dogs 
provided the genotype of one affected dog is available. However, C04107 is not 
extremely polymorphic in the BT population, showing only two alleles and a 
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calculated heterozygosity of 0.43. Therefore, typing at the C04107 will not always 
yield information about the CT status of the offepring. Thus far, all affected dogs 
have been of the 2,2 genotype and the 2 allele is more common than the 1 allele 
(see Table 6). The matings which produce affected dogs will be found to be either 
5 between parents who are both 2,2 both 1 .2 or one 1 ,2 and the other 2,2. In such 
cases, typing at the CCM107 locus will only be useful in the second and third mating 
types. In the latter mating pairs, predictive information would only be available as 
to which dogs are affected. In order to make most pedigrees In the breed 
infomnative, additional polymorphic markers closely linked toC04107 are developed. 
10 It is predicted that a battery of three to five highly polymorphic markers will make 
almost every pedigree informative. 

If strong linkage disequilibrium occurs at C04107 or nearby loci» the predictive 
power will be substantially improved. However, further studies of allele distributions 
in the BT population are needed to evaluate linkage disequilibrium, tn any case, it 
15 should be possible to dramatically reduce the frequency of this serious disease 
within a very few generations. 

As discussed above, canine copper toxicosis is present in the West Highland 
White Terrier and perhaps in several other breeds. (Thomburg. L.P. et al.. Vet 
Pathol, 27:81-88 (1990)). in the West Highland Tenier, it Is dear that the phenotype 
20 is more complex, in that there is a spectrum of liver copper levels. This marker is 
evaluated in the West Highland White Terrier breed and it is detennined whether 
there is segregation of high liver copper values with C04107. 

The foregoing discussion discloses and describes merely exemplary 
embodiments of the present invention. One skilled in the art will readily recognize 
25 from such discussion and from the accompanying claims and drawings, that various 
changes, modifications and variations can be made therein without departing from 
the spirit and scope of the invention. 

All publications referred to herein are expressly incorporated by reference. 
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WE CLAIM: 

1 . A primer comprising a polynucleotide, wherein the polynucleotide has 
a sequence selected from the group consisting of the sequences of Table 2A. 

2. The primer of Claim 1 . wherein the sequence is the Sns sequence of 
5 mari<er focus C04107 of Table 2A- 

3. The primer of Claim 1 , wherein the sequence is the Asn sequence of 
marker locus C04107 of Table 2A. 

4. The primer of Claim 1 , wherein the sequence is the Sns sequence of 
the marker locus 0041 07B of Table 2A. 

10 5. The primer of Claim 1, wherein the sequence is the Asn sequence of 

the mariner locus C04107B of Table 2A. 

6. A method for amplifying DNA, comprising the step of performing PGR 
with the DNA and a primer set selected from the group consisting of the primer sets 
of Table 2A, 

15 7. The method of Claim 6, wherein the primer set is that shown as the 

Sns sequence and Asn sequence of the marker locus C04107 of Table 2A. 



8. The method of Claim 6. wherein the primer set is that shown as the 
Sns sequence and Asn sequence of the marker locus C04107B of Table 2A. 
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